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(57) A method for identifying compounds that alter 
higher order chromatin dependent chromosome stabil- 
ity is based on determining the compounds' ability to 
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ful in therapy, in particular the therapy of human cancer. 
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Description 



[0001] The invention relates to a method for identifying compounds influencing chromosome dynamics in eukaryotic 
celts. In particular, the invention relates to the treatment and prevention of human conditions by modulating higher 

s order chromatin dependent chromosome stability. 

[0002] H igher-order chromatin is essential for epigenetic gene control and for the functional organisation of chromo- 
somes. Differences in higher-order chromatin structure have been linked with distinct covalent modifications of histone 
tails which regulate transcriptional 'on' or 'off states (Grunstein. 1 998; Turner. 1 998; Straht and Allis. 2000) and influence 
chromosome condensation and segregation (Karpen and Alishire. 1997; Wei et af., 1999). 

10 [0003] Histones constitute a highly conserved family of proteins (H3, H4, H2A, H2B, H1) which are the major com- 
ponents of eukaryotic chromatin structure. Histones compact genomic DNA into basic repeating structural units, the 
nucleosomes. In addition to their DNA packaging function, histones have been proven to be integral components of 
the molecular machinery that regulates gene expression. 

[0004] Post-translational modifications of histone N-termini, particularly of H4 and H3. are well documented and have 
IS functionally been characterised as changes in acetylation (Grunstein. 1998; Turner. 1998; Strahl and Allis, 2000), 
phosphorylation (Wei ef a/., 1999) and, most recently, methylation (Chen efa/., 1999; Strahl ef a/., 1999). In contrast 
to the large number of described histone acety {transferases (HATs) and histone deacetylases (HDACs). genes encod- 
ing enzymatic activities that regulate phosphorylation (Sassone>Corsi era/., 1999) or methylation (Chen ef at., 1999) 
of histone N-termini are only beginning to be identified. Moreover, the interdependence of the different histone tail 
so modifications for the integration of transcriptional output or higher-order chromatin organisation is currently not under- 
stood. 

[0005] Overall, there is increasing evidence that the regulation of normal and aberrant cellular proliferation is not 
only affected on the transcriptional level, but that also a higher level of regulation is involved, i.e. the organisation of 
chromatin structure through the modification of histone molecules. The determination of the proteins and the molecular 
25 mechanisms involved in histone modification will contribute to the understanding of the cellular proliferation program 
and will thus shed led light on the mechanisms involved in aberrant proliferation occurring in tumour formation and 
progression (Jacobson and Pillus, 1999). 

[0O06] Genetic screens for suppressors of position effect variegation (PEV) in DrosophHa (Reuter and Spierer. 1992) 
and S.pombe (Alishire ef a/., 1 995) have identified a subfamily of approximately 30-40 toci which are referred to as Su 

30 (varygroup (Wallrath. 1 998) genes. Interestingly, several histone deacetylases (De Rubertis ef a/., 1996), protein phos- 
phatase type 1 (Baksa ef a/., 1 993) and S-adenosyl methionine synthetase (Larsson et al., 1996) have been classified 
as Su(var)s, In contrast, Su(var)2-5 {\Nh\ch is allelic to HP1) (Eissenberg ef a/.. 1992). Siifv^ar;5- 7 (Cleard ef a/., 1997) 
and Su(var)3'9 (Tschiersch ef a/., 1994; Schotta and Reuter, 2000) encode hete roc hro matin-associated proteins. Su 
(var) gene function thus suggests a model, in which modifications at the nucleosomal level may initiate the formation 

55 of defined chromosomal subdomains that are then stabilised and propagated by hete roc hro matic SU(VAR) proteins 
(Henikoff, 1 997). 

[0007] Su(var)3-9 is dominant over most PEV modifier mutations (Tschiersch et a/.. 1 994), and mutants in the cor- 
responding S.pombe cir4gene (Ivanova ef a/., 1998) disrupt heterochromatin association of other modifying factors 
and result in chromosome segregation defects (Ekwall ef a/., 1 996). Recently, human (SUV39H1) and murine {Suv39h1 

40 and Suv39h2) Su(var)3'9 homologues have been.isolated (Aagaard et ai, 1 999), It has been shown that they encode 
heterochromatic proteins which associate with mammalian H PI (Aagaard efaA. 1 999). The SU(VAR)3-9 protein family 
combines two of the mostevolutionarily conserved domains of 'chromatin regulators': the chromo (Aasland and Stewart, 
1 995: Koonin ef a/., 1 995) and the SET (Tschiersch ef a/.. 1 994 ; Jenuwein ef a/., 1998) domain. Whereas the 60 amino 
acids chromo domain represents an ancient histone-iike fold (Ball et a/., 1997) that directs eu- or heterochromatic 

'45 localisations (Platero ef a/.. 1 995), the molecular role of the 130 amino ackJs SET domain has remained enigmatic. 
Overexpression studies with human SUV39H1 mutants indicated a dominant interference with higher-order chromatin 
organisation that, surprisingly, suggested a functional relationship between the SET domain and the distribution of 
phosphorylated (at serine 1 0) H3 (Melcher ef a/., 2000). The experiments of the present invention show that mammalian 
SU(VAR)3-9 related proteins (human SUV39H1, murine Suv39h1 and murine Suv39h2) are SET domain-dependent 

50 H3-specific histone methy transferases which selectively methylate lysine 9 ("K9") of the H3 N-terminus. Methylation 
of K9 negatively regulates phosphorylation of serine 10 and reveals a 'histone code' that appears intrinsically linked 
to the organisation of higher-order chromatin. (In the following, histone rnethy transferases are termed "HMTases" or, 
more generally, "MTases"). 

[0008] It was an object of the invention to gain further insight in the molecular pathways leading to histone modifi- 
55 cations and higher-order chromatin organisation in order to harness these findings as a target for interfering wrth ab- 
errant gene expression and genomic instability through chromosome missegregation and thus provide new cancer 
therapies. 

[0009] In particular, it was an object of the invention to investigate the function of members of the SU(VAR)3-9 protein 
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family wKh the view to develop novel strategies to affect higher-order chromatin dependent chromosome stability. Such 
strategies can be employed in therapies for the treatment of conditions in which aberrant gene expression and genomic 
instability through chromosome missegregation are causally involved. (The term "chromosome stability" implies suc- 
cessful segregation of chromosomes resulting in the maintenance of a stable karyotype). 
5 [0010] To solve the problem underlying the present invention, in a first step bioinformatic techniques were applied. 
Using the SET domains of the SU(VAR)3-9 protein family as a starting alignment, significant sequence and secondary 
structure similarities (see Methods) to six plant protein methy transferases were detected. 

10011] To investigate whether the SET domain of human SUV39H1 has enzymatic activity, histones were tested as 
possible substrates for in vitro methylation. The obtained results demonstrate that SUV39H1 harbours an intrinsic 
'0 histone methyltransf erase activity and suggest that this HMTase activity resides in the C-terminal SET domain. 

[0012] Using recombinant proteins, both murine GST-Suv39h1(82-412) and the corresponding human SUV39H1 
fusion protein [GST-SUV39H1 (82-412)] were shown to be catalytically active. Short internal deletions were introduced 
into the two conserved regions of the SET domain core in GST-SUV39H1 1(82-412), and additional mutants lacking 
the C-terminal tail (AC-tail) or the SET-associated cysteine-rich region (Acys) were generated. All mutant proteins failed 
to demonstrate HMTase activity. 

[0013] Although these results suggest a significant contribution by the cysteine-rich regions, their apparent absence 
in the plant methyKransf erases does not prevent catalytic activity. To investigate enzyme function of the SET domain 
in more detail, point mutations were introduced into the most highly conserved motif. In v/'frD HMTase assays indicated 
that all point mutations, with the exception of one, abolished enzymatic activity. Surprisingly, the latter mutation resulted 
SO in an hyperactive enzyme with approximately 20-fotd Increased activity. The data obtained define the 32oH<|>4'NHSC326 
motif In the SET domain as an important catalytic site. 

[0014] Because the SET domain is one of the most conserved protein motifs In chromatin regulators (Stassen ef a/., 
1 995 ; Jenuwein et af., 1 998), it was next analysed whether SU(VAR)3-9 family members or other SET domain proteins 
contain HMTase activity. GST-fusion products of the extended SET domains of S.pombe CLR4 (Ivanova etal., 1998), 
25 human EZH2 (Lalble et ai, 1 997) and human HRX (Tkachuk et a/., 1992) were generated that would correspond to 
GST-SUV39H1 (82-412). Interestingly, 

GST-CLR4(1 27-490) displayed pronounced HMTase activity at three- to five-fold increased levels as compared to the 
recombinant SUV39H1 product, consistent with CLR4 carrying an arginine at the hyperactive position. The results 
obtained from this analysis show, in agreement with the mutational analysis of SUV39H1 , that HMTase activity towards 
so free histones appears to require the combination of the SET domain with adjacent cysteine-rich regions, which is a 
quality found in only a restricted number of SET domain containing proteins. 

[001 5] These experiments indicated that the HMTase activity of mammalian SU(VAR)3-9 related proteins is selective 
for histone H3 under the chosen assay conditions. To examine this finding in more detail, in t^ro methylation reactions 
were performed with indrvkiual histones. It could be shown that H3 is specifk:ally methylated by GST-Suv39h1 (82-41 2), 
ss whereas no signals are detected with H2A. H2B or H4. Methylation of H3 has been shown to occur predominantly at 
lysine 4 in a wide range of organisms, as well as at lysine 9 in HeLa cells, although the responsible HMTase(s) have 
yet to be defined (Strahl et af., 1999). To investigate the site utilisation profile of Suv39h1, unmodified peptides com- 
prising the wild-type H3 N -terminus and a mutant K9L peptide were tested as substrates. Additionalty, Insulin and 
peptides comprising the N-termini of CENP-A (Sullivan etal., 1994). macroH2A (Pehrson and Fried, 1992) were in- 
eluded. These in vitm assays revealed selective methylation of the wild-type H3 peptide. The data obtained also sug- 
gested that the H3 N-terminus is a preferred residue for Suv39h1 -dependent HMTase activity. 

[0016] To more definitively determine this site preference, the wild-type H3 N-terminal peptide was in vitro methylated 
by GST-Suv39h1 (82-412), using S-adenosy!-[mer/7y/-3H]-L-methionlne. The labelled peptide, purified by reverse- 
phase HPLC, was then directly microsequenced, and 

^ ^H -incorporation associated with each Individual amino acid was analysed. The results confirmed selective transfer of 
methyl-label to lysine 9, demonstrating that Suv39h1 Is a highly site-specific HMTase for the H3 N-terminus in vitro. 
[0017] Murine Suv39h genes are encoded by 2 loci,. Suv39h1 and Suv39h2. To Investigate the in vivo significance 
of St/vSS/? function and Suv39h dependent K9 H3 methylation, mouse strains deficient for both Suv39h1 and Suv39h2 
were generated. Suv39h1 and Suv39h2 deiicterO. strains were intercrossed to produce Suv39h double deficient mice. 

so Double mutant mice were born in sub-Mendelian ratios. Some double null embryos exhibited severe growth retardation 
and exencephaty. In addition surviving double mutants were growth retarded, suggesting a role for Suv39h In cell 
proliferation. 

[0018] In order to determine whether the embryonic phenotypes in Suv39h null mice can be attributed to mitotic 
defects. PMEFs (primary mouse embryonic fbroblasts) derived from Suv39h double mrce were analysed. Suv39h 
S5 double nu II PMEFs display a reduced G 1 -index and an increased proportion of cells with aberrant nuclear morphologies, 
reminiscent of division defects during mitosis. Furthermore, double null cells also show genomic instcibilities and readily 
become aneuploid. The severity of these aneuploidles Increases with higher passage numbers. The inability of Suv39h 
double null cells to maintain a stable karyotype may underlie the Sl/v^ 9/? embryonic phenotype. 
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[0019] Phosphorylation at serine 10 in the N-terminal tail of H3 (phosH3) has been shown to be required for con- 
densatbn and subsequent segregation of chromosomes (Wei efa/.. 1 999). During the cell cycle. phosH3 initiates within 
pericentric heterochromatin in late G2 and then progresses atong the entire chromosomes during mitosis (Hendzel et 
a/.. 1997). It was found that in wild-type PMEFs, approximately 7% of the cells stain positive for the characteristic, 
heterochromatin-associated phosHS foci. In contrast, this number is increased by a factor of about 3-fold in Suv39h 
double null PMEFs. This result suggested that the overall levels of phosH3 may be enhanced in Suv39h double null 
PMEFs. This was confirmed biochemically. Together, the obtained data are most consistent with a model in which 
Suv39h-mediated methylatbn of lysine 9 in H3 negatively regulates phosphorylation of serine 1 0. 
10020] Together, these data clearly demonstrate crucial roles for SuvSdh during cell division. Loss of Suv39h f u nction 
impairs K9 histone H3 metylation and induces defective cell division resulting in genome instabilities. Segregatbn 
defects/genome instability underiies the aetiology of many, human cancers (Lengauer ef a/., 1997) and are often a 
prerequisite for tumour progression. These observations make Suv39h an excellent candidate for novel therapeutic 
approaches for tu mour therapies. 

[0021] The results obtained in the experiments of the present invention show that members of the SU(VAR)3-9 
protein family have HMTase activity which identifies them as novel targets for the therapy of proliferative disorders, in 
particular cancer. 

[0022] The identification of members of the SU(VAR)3-9 protein family, exemplified by human SUV39H1 . murine 
Suv39h1 and murine Suv39h2. as K9 specific histone H3 MTases is the prerequisite for designing assay methods that 
allow for finding compounds altering. In particular interfering with, higher order chromatin dependent chromosome 
stability, which is the basis for novel therapeutic approaches. 

[0023] The present invention relates to a method for identifying a compound that alters higher order chromatin de- 
pendent chromosome stability, said method comprising incubating a substrate for a methyhransferase, in the presence 
of a methyl donor, with a MTase with Suv39h-like MTase activity, in the presence or absence of a test compound and 
determining whether the compound modulates the MTase activity. 

[0024] The group of MTases with Suv39h-like MTase activity (in the following also termed "Suv39h-like MTases") 
encompasses enzymes which display K9 histone H3 MTase activity or methyltransferase activity for other yet to be 
identified substrates. 

[0025] The temn ''K9 histone HS" is not limited to the human SU V39H or mouse Suv39h substrate (i.e. the methylation 
site at lysine 9 of histone H3). but is meant to encompass any substrate of the histone or histone variant-type of protein 
the methylation of which results in the below-defined epigenetic signal. 

[0026] Additional members of the group of MTases can be identified by bioinformaticybiochemical techniques and 
tested biochemically. By way of example, in a first step, searching data bases for similarities, as described in Example 
1. Next, an identified candidate can be verified as a MTase with Suv39h-like MTase activity in biochemical assays 
similar to or identical with those described in the Examples. 

[0027] This group of Suv39h-like MTases also encompasses MTases with specif icrties for other histone H3 residues 
than K9 or for substrates other than histone H3, which are. like the Suv39h K9 histone H3 HMTase activity obsen/ed 
in the present invention, required for higher order chromatin dependent chromosome stability. This epigenetic signal 
may be a consequence of histone methylation at lysine 9 on H3 alone: however, it cannot be excluded that MTase 
activity on undefined substrates or a combination of substrate methylation and other covalent modifications, such as 
phosphorylation or acetylation, at other histone residues are involved. 

[0028] In the experiments of the present inventkan, Suv39h variants with point mutations in the SET domain were 
shown to confer hyperactive HMTase activity to the protein, these Suv39h variants may be advantageously used In 
the method of the invention. 

[0029] In a preferred embodiment, the MTase is mouse Suv39h1 or Suv39h2. most preferably, the MTase is human 
SUV39H1 or SUV39H2. 

[0030] Since it has been shown in the present invention that recombinant Suv39h retains HMTase activity, most 
preferably, a recombinant MTase is empbyed. Suv39h or Suv39h variants can be produced recombinantly according 
to standard methods by expression in suitable hosts, e.g. bacteria, yeast, insect or eucaryotic cells and purified, e.g. 
on glutathioneagarose columns if it has been tagged with GST. 

[0031] The Suv39h1 and SUV39H1 cDNA sequences are known from the literature (Aagaard et a!.. 1999), the 
Suv39h2 cDNA sequence is shown in SEQ ID NO: 1 : the human SUV39H2 cDNA is defined by the ESTs as shown in 

SEQ ID NO: 3 - 6. 

[0032] In the case of testing the compounds for their effect on Suv39h activity, the assay comprises, as its essential 
features, incubating a histone H3 protein or histone H3 N-terminal fragment including K9. a methyl donor. S-adenosyl- 
L-Methionine with a preparation containing a Suv39h MTase activity and determining MTase activity in the presence 
or absence of a test substance. 

[0033] MTase substrates useful in the method of the invention may be those equivalent to or mimicking the naturally 
occurring substrates, e.g. biochemically purified histone H3, recombinantly produced histone H3, or a histone H3 pep- 
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tide that contains the K9 methylation site, or other yet to be identified proteins which act as substrates for Suv39h 
MTases. Novel Suv39h substrates can be identified by bioinformatic and biochemica) techniques and tested using the 
biochemical assays described in the Examples of the present invention. For example, novel Suv39h substrates can 
be identified by co-immurwprecipitation techniques. Suv39h proteins or tagged versions of Suv39h proteins can be 
s immunoprecipitated with specific antisera and interacting proteins identified by mass spectroscopy techniques. A yeast 
two hybrid screen using Suv39h proteins or portions of Suv39h proteins as a bait can also be employed to iderttify 
novel interacting protein from a variety of cDNA libraries. 
(0034) In a preferred embodiment, the histone H3 fragment 

ARTKQTARKSTGGKAPRKQL (SEQ ID NO:7) is employed. Alternatively, a modified peptide may be used for which 
10 the MTase has increased affinity/activity. Such peptides can be designed by exchanging arrd/or adding and/or deleting 
amino acids and testing the substrate in serial experiments for MTase affinity/activity. 

[0035] The methyl donor preferably carries a detectable label, e.g. a radioactive or a chromogenic label, which can 
be quantified upon transfer to the substrate. 

[0036] Preferably, the methyl donor is radio-labelled methionine or S-adenosyl-L-methionine. 
IS [0037] Alternatively to using a labelled methyl donor, the substrate, upon methylation by the enzyme, is used to sen^e 

as an epitope which can be recognised by a specific antibody and hence be quantified by standard immunoassay 

techniques, e.g. EUSAs. Antibodies useful in this type of assay can be obtained by using the methylated substrate. 

preferably a small peptide, e.g. the peptide with the sequence shown in SEQ ID NO:7, as an antigen and obtaining 

polyclonal or monoclonal antibodies according to standard techniques. 
20 [0038] For small scale applications, the screening method can be based on an assay as described in Example 2, 3 

or 4. 

[0039] In a preferred embodiment, the method of the invention is performed on a high-throughput scale. For this 

embodiment, the major assay components, in particular Suv39h, are employed in recombinant form. 

[0040] For the high throughput format, the screening methods of the invention to identify MTase inhibitors, are carried 

25 out according to standard assay procedures. Such assays are based on the catalytic transfer, mediated by Suv39h or 
a Suv39h variant, of a methyl group from a chromogenic substrate to a histone H3 peptide. To achieve this, the sub- 
strate, e.g. histone H3 or a variant or fragment thereof, is immobilised on a carrier, usually a microtiter plate, and 
incubated with recombinant Suv39h or a Suv39h variant and a methyl donor. The methyl donor preferably carries a 
chromogenic or radioactive label. Upon transfer of the methyl group to the substrate by Suv39h, in the case of a 

30 chromogenic reagent, the methyl donor changes colour which can be quantified. In the case of using a radioactive 
methyl donor, the methyl group is transferred to the substrate and can be quantified. 

[0041] Alternatively, the methylation of the substrate can be quantified by ELISA using an antibody specific for the 
methylated substrate. If a test substance is an inhibitor of the MTase activity, there will be. depending on the detectbn 
system and depending on whether the test substance has an inhibiting or an activating effect, a decrease or an increase 
35 in the detectable signal. 

[0042] In the high-throughput format, compounds with a modulating effect Suv39h MTase activity can be identified 
by screening test substances from compound libraries according to known assay principles, e.g. in an automated 
system on microtiter plates. 

[0043] By providing a method to identify compounds which exert their effect by directly modulating, in particular by 
40 inhibitir^. a Suv39h-like MTase. the present invention provides a basis for inhibiting the proliferation of rapidly dividing 
animal ceils, in particular tumour cells. 

[0044] The compounds identified in the above methods, which are also subject of the invention, have the ability to 
interfere with chromosome stability and high fidelity chromosome segregation by modulating the MTase activity of 
Suv39h. 

^ [0045] In a preferred embodiment, the compounds of the invention are inhibitors of Suv39h HMTase activity. 
[0046] Preferably, the compounds are specific modulators of Suv39h. 

[0047] The present invention also relates to compounds, which act as modulators of a Suv39h-like MTase activity, 
in particular modulators of Suv39h, for use in human therapy, in particular cancer therapy. 

[0048] Compounds inhibiting Suv39h HMTase activity result in decreased genome stability and can be used in ther- 
50 apy, optionally in combination with other genome destabilising agents, to target highly proliferative tumour cells. Like- 
wise, agents which enhance Suv39h HMTase activity can be used to stabilise the genome of inherently unstable cells 
rendering them less prone to acquiring proliferation promoting mutations. A model for Sui^39/? function and effects of 
inhibition or enhancement of Suv39h enzymes is shown in Figure 8. 

[0049] Compounds inhibiting Suv39h HMTase activity result in decreased genome stability and can be used in ther- 
55 apy. optionally in combination with other genome destabilising agents, to target highly proliferative tumour cells. Like- 
wise, agents which enhance Suv39h HMTase activity can be used to stabilise the genome of inherently unstable cells, 
rendering them less prone to acquiring proliferation promoting mutations. 

[0050] The efficacy of compounds identified as Suv39h modulators can be tested for in vivo efficacy in mammalian 
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cells with Suv39h doub\e null cells serving as a positive control. Effective compounds should interfere with chromosome 
stability and segregation, which can be measured by karyotyping, e.g. by analysing the DNA content by FACS or 
standard cyto logical techniques. Substances whose potential for therapeutic use has been confirmed in such secondary 
screens can be further tested for their effect on tumour cells. To test the inhibition of tumour cell proliferation, primary 
S human tumour cells are incubated with the compound identified in the screen and the inhibition of tumour cell prolifer- 
ation is tested by conventbnal methods, e.g. bromo-desoxy-urldine or^H thymidine incorporation. Compounds that 
exha^it an anti-proiiferative effect in these assays may be further tested in tumour animal models and used for the 
therapy of tumours. 

[0051 ] Toxicity and therapeutic efficacy of the compounds identified as drug candidates by the method of the invention 
10 can be determined by standard pharmaceutical procedures, which include conducting cell culture and animal experi- 
ments to determine the IC50. LD5O, the ED50. The data obtained are used for determining the human dose range, which 
will also depend on the dosage form (tablets, capsules, aerosol sprays, ampules, etc.) and the administration route 
(oral, buccal, nasal, paterental or rectal). A pharmaceutical composition containing the compound as the active ingre- 
dient can be formulated in conventional manner using one or more physo logically active carriers and excipients. Meth- 
15 ods for making such formulations can be found in manuals, e.g. "Remington Pharmaceutical Sciences". 

[0052] As Suv39h is required to maintain a stable karyotype, it can be considered as a tumour suppressor gene. If 
Suv39h mutations prove to be a factor underlying cellular transformation events - which can be confirmed by analysis 
of Suv39h double deficient mice for tumour development and progression it can be expected that the re- introduction 
of a wild type Suv39h gene by gene therapy results in increased genomic stability delaying or inhibiting cancer pro- 
20 gression. 

[0053] For gene therapy, the Suv39h DNA molecule may be administered, preferably contained on a plasmid in 
recombinant form, directly or as part of a recombinant virus or bacterium. In principle, any method of gene therapy 
may be used for applying Suv39h recombinant DNA, both in vivo and ex viva 

[0054] Examples of in vivo administration are the direct injection of "naked" DNA. either by intramuscular route or 
25 using gene guns. Examples of recombinant organisms are vaccinia virus or adenovirus. Moreover, synthetic carriers 
for nucleic acids such as cationic iipkis, microspheres, micropellets or liposomes may be used for in wVo administration 
of nucleic acid molecules coding for the Suv39h polypeptide. 

[0055] Since it has been shown in the present invention that Suv39h mediates dynamic transitions in higher-order 
mammalian chromatin in part through its intrinsic HMTase activity, K9 methylation of histone H3 (K9-Me) represents 
so an important epigenetic imprint for chromosome dynamics during cell division. Hence, antibodies specific for K9-Me 
can be used to screen cells/patients for hete roc hro matin based genome instabilities. In essence, K9-methylation spe- 
cific anti-sera can be used as a diagnostic tool for several potential human diseases. 

Brief description of figures: 

35 

[0056] 

Figure 1 : HMTase activity of transfected and recombinant SUV39H1 and Suv39h1 proteins. 
40 Figure 2: Specific HMTase activity of the SET domain of mammalian SU(VAR)3-9 related proteins. 

Figure 3: Lysine 9 of the H3 N-terminus is the major site for in vitro methylation by recombinant Suv39h1 . 
Figure 4: . Targeting Suv39h1 and Suv39h2 \n the mouse germtine. 

45 

Figure 5: Analysis of Suv39h double null PMEFs. 
Figure 6: Genomic instabilities in Suv39h double null PMEFs. 
so Figure 7: Increased phosH3 phosphorylation in Suv55/? double null PMEFs. 
Figure 8: Model for Suv39h function. 
Materials and Methods 

55 

[0057] 

a) Sequence alignments and secondary structure predictions 
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The SET domains of human SUV39H1 (Aagaard et ai., 1999), Dmsopht'/a SU(VAR)3-9 (Tschiersch et al., 
1 994) and S,pomheC\JR4 (tvanova etal,, 199fi) were used as a multiple starting alignment for database similarity 
searches using Profile (Birney et aL^ 1996), hidden Markov (Eddy, 1998) and position-specific iterative BLAST 
(Altschul et ai., 1997) methods (representative listings are available from the SET domain page of the SMART 

5 www-server (SchuKz eta/., 2000). These searches revealed significant similarities to six plant proteins (accessbn 

numbers 043088. 065218. P94026. 08001 3. AAC291 37 and AC007576_1 2) described as putative lysine N-meth- 
yltransferases. For example, a PSI-BLAST search with the S.pombe hypothetical protein SPAC3c7.09 as query 
identified these plant sequences and well-knowm SET domain sequences within ten rounds using an E-value in- 
clusion threshold of 0.001 . The same search also revealed the presence of a SET domain in YHR1 09w (which is 

'0 known to encode a cytochrome c MTase (Martzen et a/., 1 999)) within three rounds. Consensus secondary struc- 

tures were predicted by described algorithms (Frishman and Argos, 1997). 

b) Epitope -tagged SUV39H1 proteins in HeLa cells 

The HeLa cell lines overexpressing fulMength (myc)3-SUV39H1 (aa 3-412) or (myc)3-Nchromo (aa 3-118) 
IS have been described (Aagaard et at., 1 999; Meteher et af., 2000). Nuclear extracts were immunoprecipitated with 

anti-myc antibody beads (Aagaard et a/., 1999), and approximately 1-3 (ig of matrix-bound (myc)3-tagged 
SUV39H1 proteins were used for m vitro HMTase assays. 

c) Generation and purification of GST-fusbn proteins 

20 The GST-Suvl (82-41 2) product expressed from the pGEX-2T vector (Pharmacia) as a glutathione-S-trans- 

ferase (GST) fusion protein has been described (Aagaard ef a/., 1 999). Additional GST constructs were generated 
by transferring BamHI-EcoRI PGR amplicons into pGEX-2T, encoding in-frame fusions for Suv39h1 (7-221), 
SUV39H1 (82-412). SUV39H1 (82-378) AC-tail, SUV39H1 (255-41 2) Acys. Suv39h2(1 57-477) (O'Carroll ef a/., sub- 
mitted), CLR4(1 27-490) (Ivanova era/.. 1998). E2H2(382-747) (Latble e/a/., 1997) and HRX(3643-3969) (Tkachuk 

25 et a/.. 1992). Short internal deletions (ANHSCDPN323.329 and 

AGEELTFDY358^3g5) or point mutations within the -32oH4>4'NHSC326-motif were directly engineered in the GST- 
SU\/39H1 (82-412) plasmid by double PGR mutagenesis. All constructs were confinned by sequencing. 

Recombinant proteins were expressed in II cultures of E.co// strain BL21 and solubilized in 10 ml RlPA buffer 
1(20 mM Tris pH 7.5. 500 mM NaCI, 5 mM EDTA, 1% NP-40, 0.5% sodium deoxycholate) containing a full set of 

30 protease inhibitors (Boehringer Mannheim) and lysozyme (5 mg/ml; Sigma)] by f reeze-thawing in Ikiuid N^, followed 

by sonication. Soluble proteins were cleared by centrifugation. purified with 800 m.1 glutathione Sepharose beads 
(Pharmacia) and washed twice in RlPA buffer. Protein concentration was determined by Coomassie staining of 
SDS-PAGE gets. Matrix-bound fusion proteins were used immediately for in y/itro HMTase assays or stored at 4°C. 

35 d) In vitro histone methy It ransf erase (HMTase) assay 

in vitro HMTase reactions were modified based on described protocols (Strahl eta/., 1999) and carried out in 

a volume of 50 ^1 of methylase activity buffer (MAB: 50 mM Tris pH 8.5. 20 mM KOI. 10 mM MgCl2. 10 mM ^-ME. 

250 mM sucrose), containing 10 ^g of free histones (mixture of HI , H3, H2B, H2A and H4; Boehringer Mannheim) 

as substrates and 300 nCi S-adenosyl-[mef/7y/-'*^]-L-methk>nine (25 ^Ci/ml) (Amersham) as methyl donor. 10 ^g 
40 of matrix-bound GST-fusion proteins were routinely used to assay for HMTase activity. After incubation for 60 min. 

at 37''C, reactions were stopped by boiling in SDS loading buffer, and proteins were separated by 15% or 18% 

SDS-PAGE and visualised by Goomassie staining and fluorography. 

HMTase assays with individual histones (Boehringer Mannheim), insulin (Sigma) or N-terminal peptides were 

performed with 5 yig of substrate. The following peptides were used: wild-type N -terminus of human histone H3 
45 (ARTKQTARKSTG-GKAPRKQL) (SEQ ID NO:7) and mutant peptide which changes lysine 9 (bold) to leucine; N- 

terminus of human CENP-A (MGPRRRSRKPEAPRRRSPSP) (SEQ ID NO;8) (Sullivan efa/., 1994); N-terminus 

of rat macro-H2A (MSSRGGKKKSTKTSRSAKAG) (SEQ ID NO:9) (Pehrson and Fried, 1992). 

Peptide microsequencing of the in vitro methylated wild-type H3 N-terminal peptide and determination of ^H- 

incorporation of individual amino acids by scintillation counting was done as described (Strahl ef a/., 1999). 

50 

e) Generation of Suv39h1 and Suv39h2 deficient mice by gene targeting 

Suv39h1 maps to the X-chromosome. The cloned partial Suv39h1 genomic locus (O'Carroll eta/., submitted) 
was used to generate a targeting construct. AI .2kb Ru PGR amplicon. generated with the primers gM3-9(SII) and 
gM3-9(RI), was used as a short arm of homology and cloned in frame with the nIs-lacZ gene of the pGNA-T vector. 
55 This places the first 3 amino acids of exon 2 in frame with the nls-lacZ gene generating a fusion protein of the first 

8 amino acids of Suv39h1 and lacZfrom the targeted locus. A 5.4kb Sad (filled in) fragmentfrom St/i^^d/} 7 genomic 
subclone gSuv39h1 #18 was used as a long arm of homology. 

Suv39ti2is autosomal and maps to chromosome 2. The cloned partial Sin^dd/?^ genomic locus (O'Carroll et 
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ai, submitted) was used to generate a targeting construct. A1 .4 kb Pfu PGR amplicon. generated with the primers 
Suv2SII and Suv2RI, was used as a short arm of homology and cloned in frame with the nIs-lacZ gene of the 
pGNA-T vector. This places the first 113 amino acids of exon 2 in frame with the nIs-lacZ gene generating a fusion 
protein of the first 113 amino acids of Suv39h2 and lacZ from the targeted locus. A 4.9kb Mlul/Apal (filled in) 
fragment from Suv39h2Qenom\c subclone gSuv39h24^28 was used as long arm of homology to inactivate the locus. 

These constructs were linearised with Notl. electroporated into R1 ES cells (Suv39h1) and El 4.1 ES cells 
(Suv39h2), ES cells were put under G41 8 selection and G418 resistant colonies screened for homologous recom- 
binatbn by PGR and Southern blot analysis. Targeted feeder dependent ES cell clones were injected into blasto- 
cysts of C57BL/6 mice and relmplanted into pseudopregnant females to produce chimeric offspring. Germ-line 
transmission was obtained after a backcross between chimeric males and C57BL/6 females. Heterozygous mice 
were interbred to obtain Suv39h1 and Si/v^d^^ deficient mice. Suv39h1 and Suv39h2 dei'icmrit mice were then 
interbred to generate Suv39h double deficient mice. 

f) Generation and analysis of Suv39h double null primary mouse embryonic fibroblasts (PMEFs) 

PMEFs were derived from day El 2.5 Suv39h double null embryos obtained after intercrossing 
Suv39h1^iSuv39h2*'^' compound mutant mice. As controls, PMEFs were prepared from wild-type embryos of the 
same genetic background. For cell cycle profiles and growth curve analysis, passage 2 PMEFs were analyzed as 
described (Xu et af., 1 999). Staining of PMEF interphase chromatin with anti-phosH3 (Hendzel a/., 1 997) anti- 
bodies was done in untreated cells. For the biochemical analysis, total nuclear extracts were precalibrated by 
Ponceau staining, immu no-blotted with anti-H3 (Upstate Biotechnology) and anti-phosH3 (Hendzel et aL, 1997) 
antibodies and visualised by peroxidase staining using Enhanced GhemiLuminescence (EGL) (Amersham). 



[0058] Using the SET domains of the SU(VAR)3-9 protein family as a starting alignment, significant sequence and 
secondary structure similarities (see Methods) to six plant protein methyltransferases were detected. Although some 
of these plant sequences have been classified as potential histone lysine N-methyltransferases, only one had been 

50 functionally characterised but was found to lack HMTase activity (Klein and Houtz, 1995; Zheng et ai.. 1 998). 

[0059] Detected were amino acid and secondary structure [p-sheet (b) or a-helix (h)] similarities of the C-terminal 
halvesofSETdomain sequences from human SUV39H1 (Aagaard ef a/. , 1 999) (AF01 9968). murine Suv39 hi (Aagaard 
©fa/., 1999) (AF019969), murine Suv39h2 (O'Carroll ef a/., submitted), (AF149205). Drosophila SU(VAR)3-9 (Tsch- 
iersch efa/., 1994) (P45975). a C.e/epans SU(VAR)3-9-like ORF C15H11.5 (CAB02737), S.pombeCLR4 (Ivanova et 

S5 a!., 1 998) (074565). human EZH2 (Laible efa/.. 1 997) (Q1 591 0). the human trithorax homologue HRX (Tkachuk et a/.. 
1992) (Q03164). and MTases from Rsativum (rubisco Is-MT; Q43088) (Klein and Houtz. 1995; Zheng efa/.. 1998) and 
Athaliana (06521 8). The plant MTase sequences contain an insertion of approximately 100 amino acids in the middle 
of the SET domain. 

40 Example 2 

[0060] HMTase activity of transfected and recombinant SUV39H1 and Suv39h1 proteins. 

[0061] To investigate whether the SET domain of human SUV39H1 has enzymatk; activity, histones were tested as 
possible substrates for in vitro methylation. Using HeLa cell lines 'stably' expressing triple myc-tagged full-length 

45 SUV39H1 (aa 3-412), the ectopk: protein was enrrched from nuclear extracts by immunopreciprtation with anti-myc 
beads (see Figure 1/1 A, arrowhead top panel) and probed for activity to transfer a labelled methyl group from S- 
adenosyl-[mefrtyA^*G]-L-methionine to free histones according to described conditions (Strahl ef a/., 1 999). Reaction 
products were separated by SDS-PAGE and visualised by fluorography. indicating selective transfer of the methyl- 
label to H3 (Figure 1/1 A. lower panel). By contrast, no signals were detected with extracts from a HeLa cell line that 

50 expresses only the N-terminal third of SUV39H1 (aa 3-11 8) or with extracts from HeLa control cells. To confirm that 
the HMTase activity is an intrinsic property of SUV39H1 and not mediated by a SU V39H1 -associated factor, the in vitro 
HMTase reactions was repeated with recombinant products that were purified as GST-fusion proteins from E.co// (see 
Figure 1/2B, arrowheads top panel). For this analysis, murine Suv39h1 , which is 95% identical to human SUV39H1 
(Aagaard ef a/.. 1 999) was used. A purified GST-product comprising aa 82-41 2 maintained HMTase activity (although 

55 at a reduced level as compared to transfected SUV39H1), whereas a purified GST-product comprising aa 7-221 proved 
negative, even at higher protein concentrations (Figure 1/2B. lower panel). These results suggest that the HMTase 
activity resides in the C-terminal SET domain. 

[0062] In Fig. 1A. triple myc-ta^ed fuH-length human SUV39H1 (aa 3-412) or a C-termina!ly truncated SUV39H1 
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protein (aa 3>1 1 8) were immunoprecipitated from 'stably' transfected HeLa cell lines with antHmyc antibody beads and 
used In in vitro HMTase reactions with free htstones as substrates and S-adenosyl-[/nef/7y/-i4C]-L*methionlne as methyl 
donor. The Coomassie stain (top panel) shows purified proteins by arrowheads and free histories by dots. Fluorography 
(bottom panel. Figure 1A) indicates HMTase activity of (myc)3-SUV1 (3-412). 
5 [0063] In the experiments shown in Fig. 1 B, recombinarrt GST-fusion proteins encoding different domains of murine 
Suv39h1were used in increasing protein concentrations for in vitro HMTase reactions as described above. 

Example 3 

10 [0064] Definition of a catalytic motif in the SET domain of human SUV39H1 . 

[0065] Similar to the recombinarrt murine GST-Suv39h1 (82-412) product, the corresponding human SUV39H1 fusion 
protein [GST-SUV39H1 (82-412)] is catalytically active (see Figure 2). Short internal deletions (ANHSCDPN323. 32©; aNH- 
SC and AGEELTFDY358.3e5; AGEEL) were introduced into the two consen/ed regions of the SET domain core in GST- 
SLIV39H11 (82-412) and. in addition, mutants that lack the C-terminal tail (DC-tail) or the SET-associated cysteine-rich 

IS region (Dcys) were also generated. All mutant proteins failed to demonstrate HMTase activity (see Figure 2/1 A). To 
investigate enzyme function of the SET domain in more detail, point mutations were introduced into the most highly 
conserved motif, in vitro HMTase assays indicated that all point mutations, with the exception of one. abolished enzymatic 
activity. Surprisingly, the latter mutation (H320R) resulted in an hyperactive enzyme with approximately 20-fold increased 
activity. The data obtained define the 32oHDD NHSC326 motif in the SET domain as an importam catalytic site. 

20 [0066] Specific HMTase activfty of the SET domain of mammalian SU(VAR)3-9 related proteins. 

[0067] Because the SET domain is one of the most conserved protein motifs in chromatin regulators (Stassen et al. . 
1995; Jenuwein ©r a/., 1998). it was next analyzed whether SU(VAR)3-9 family members or other SET domain proteiris 
contain HMTase activity. GST-fusion products of the extended SET domains of S.pombe CLR4 (Ivanova etal., 1998). 
human EZH2 (Laible etal., 1997) and human HRX (Tkachuk eta/,, 1992) were generated that would correspond to 

25 GST-SUV39H 1(82-4 12) (see Figure 2/2B). Interestingly, GST-CLR4( 127-490) displayed pronounced HMTase activity 
at three- to five-fold increased levels (see Figure 2/2C) as compared to the recombinant SUV39H1 product, consistent 
with CLR4 carrying an arginine at the hyperactive position. By contrast, both G ST- E2H 2 (382-747) and GST-HRX 
(3643-3966) had undetectable HMTase activity towards free histones (Figure 2/2C), whereas a compeirable GST prod- 
uct generated from the recently isolated murine Suv39h2 gene (O'Carroll et al., submitted), GST-Suv39h2 (157-477), 

30 was as active as GST-SUV39H1 (82-41 2). E2H2 lacks the C-terminal cysteines, and HRX does not contain the SET 
associated cysteine-rich region (Figure 28). Both of these cysteine domains are present in CLR4, Suv39h2 and 
SU V39H1 . In agreemem with the mutational analysis of SUV39H1 . it thus appears that HMTase activity towards free 
histones requires the combination of the SET domain with adjacent cysteine-rich regions, and is a quality found in only 
a restricted number of SET domain containing proteins. 

35 (0068] In Fig. 2A, approximately 10 ^g of the indicated fusion proteins encoding GST-SU VI (82-412) (= human 
SUV39H1 ) and seven SET domain mutants were used in in vitro HMTase reactions with free histones as outlined in 
Figure 1. For the hyperactive H320R mutant, only 1 jig (10%) of the corresponding fusion product was used. Fig 2 
shows a diagram representing the domain structures of CLR4. Suv39h2. SUV39H1 , EZH2 and HRX proteins, with the 
arrowheads demarcating the N-terminal fusion to GST. Cysteine-rich regions -are indicated by grey stippling. 

40 [0069] In Fig. 2C, approximately 10 ^g of the indicated fusion proteins encoding S.pombe CLR4 [GST-CLR4 
(127-490)1, murine Suv39h2 IGST-Suv2(1 57-477)]. human EZH2 (GST-EZH2(382-747)]. human HRX [GST-HRX 
(3643-3969)1 and human SUV39H1 [GST-SUV1 (82-402)] were used in in vitro HMTase reactions with free histones 
as outlined in Figure 1 . 

45 Example 4 

[0070] Lysine 9 of the H3 N-terminus is the major site for in vitro methylation by recombinant Suv39h1 . 
[0071] The above Examples indicated that the HMTase activity of mammalian SU(VAR)3-9 related proteins is selec- 
tive for H3 under the chosen assay conditions. To examine this finding In more detail, in vitro methylation reactions 

50 were performed with individual histones, using GST-Suv39h1 (82-412) as an enzyme. As shown in Figure 3/1 A, H3 is 
specifically methylated by GST-Suv39h1 (82-412). whereas no signals are detected with H2A. H2B or H4. A weak 
signal is present if HI was used as the sole substrate; the significance of HI methylation remains to be determined. 
Methylation of H3 has been shown to occur predominantly at lysine 4 in a wide range of organisms, as well as at lysine 
9 in HeLa cells, although the responsible HMTase(s) have yet to be defined (Strahl era/.. 1999). To investigate the site 

55 utilisation profile of Suv39h1 , unmodified peptides comprising the wild-type H3 N-terminus (aa 1-20) and a mutant K9L 
peptide, changing lysine 9 to leucine were tested as substrates. Additionally, insulin and peptides comprising the N- 
termini of CENP-A (Sullivan et al., 1994) and macroH2A (Pehrson and Fried. 1992) were included. Peptides were in 
vitro methylated by GST-Suv39h1 (82-412). and reaction products were separated by high percentage SOS-PAGE and 
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visualised by fluorography. These in vitro assays revealed selective methylation of the wild-type H3 peptide, whereas 
no signals were detected with the CENP-A or macroH2A peptides, or with insulin (see Figure 3/2 B). Importantly, the 
mutated H3 (K9L) peptide was not a substrate, suggesting that lysine 9 of the H3 N-terminus is a preferred residue for 
Suv39h1 -dependent HMTase activity. 
5 [0072] To more definitively determine this site preference, the wild-type H3 N-terminal peptide was in vitro methylated 
by GST-Suv39h1 (82-412), using S-adenosyl-Imef/TKA^HJ-L-methionine. The labelled peptide, purified by reverse- 
phase HPLC, was then directly microsequenced, arKi 

3H-incorporalton associated wKh each individual amino acid was analysed by scintillation counting. The results con- 
firmed selective transfer of methyl-label to lysine 9 (see Figure 3/2C), demonstrating that Suv3dh1 is a highly site- 
10 specific HMTase for the H3 N-terminus in vitro. 

[0073] In Fig. 3A. approximately 1 0 ^g of murine GST-Suv39h1 (82-41 2) were used in in vitro HMTase reactions with 
individual histones as outlined in Figure 1 . 

[0074] Fig 3B shows the results of in vhro methylation assays using GST-Suv39h1 (82-41 2) as enzyme and the 
indicated N-terminal peptides of wild-type H3, mutated H3 (K9L), CENP-A, macroH2A or insulin as substrates. 
'5 [0075] Fig 3C shows the result of automated sequencing of the wild-type H3 N-terminal peptide (aa 1-20) that had 
been methylated in vitro by recombinant GST-Suv39h1 (82-412). Displayed is the ^H -incorporation of individual amino 
acids identifted at each successive round of microsequencing. 

Example 5 

20 

[0076] Targeting the Suv39h1 and Suv39h2 loci in the mouse germline. 

[0077] Murine Suv39hgenes are encoded by 2 loci, Suv39ti1 and Suv39h2{0'Carro\\etaL, submitted).To investigate 
the in vivo significance of Suv39h function and Suv39h dependent K9 H3 methylation, mouse strains deficient for both 
Suv39h1 and Suv39h2were generated according to standard techniques. The targeting strategies are shown in Figure 
25 4, as well as demonstrating the production of null alleles for both Suv39h 1 and Suv39h2, Mutation of either gene results 
in viable and fertile mice as a consequence of functional redundancy between both loci. Therefore. Suv39h1 and 
Suv39h2 dei\c\ent strains were intercrossed to produce Suv39h double deficient mice. 

[0078] Double mutant mice are bom in sub-Mendelian ratbs, approximately 20% of the expected double mutants 
are observed. Some double null embryos exhibit severe growth retardation and exencephaly. In additbn, surviving 

30 double mutants are growth retarded, suggesting a role for Suv39h in cell proliferation. 

[0079] Fig. 4 shows a conventional targeting strategy used to inactivate the X-linked Suv39h1 locus. Fig. 48 shows 
the Northern blot analysis of Suv39h1 from spleen (Sp), liver (Li), kidney (Kidney), and brain (Br) from wild-type and 
Suv39h1 null mice. Fig. 4C shows the conventional targeting strategy used to inactivate the autosomal Suv59/i2 locus. 
(Bottom panel) Western blot analysis wKh anti-Suv39h2 antibodies on protein extracts derived from wild-type and 

35 Suv39h2 null testis. 

Example 6 

[0080] Increased aneuploidies in Suv39h double null primary mouse fibroblasts (PMEFs). 

40 [0081] In order to determine whether the embryonic phenotypes in Suv39h double null mice can be attributed to 
mitotic defects, PMEFs derived iTom Suv39h double mice were analysed. Cell cycle profiles of wild-type and Suv39h 
double null PMEFs indicated rather similar percentages of cells to be in S- and G2/M-phases (see Rgure 5A), whereas 
Suv39h double null PMEFs display a reduced G1 -index and an increased proportion of cells with aberrant nuclear 
morphologies, reminiscent of division defects during mitosis. For example. SuvS9/i double null PMEFs contain approx- 

45 imately two-fold elevated numbers of cells with micro- and polynuclei, and are further characterised by cell subpopu- 
lations v^h oversized nuclei or a weak definition of heterochromatin that appears in only a few unusually condensed 
foci (see Figure 5B). Furthermore, double null cells also show genomic instabilities and readily become aneuploid. The 
severity of these aneuploidies increases with higher passage numbers (see Figure 6). The inability of double null cells 
to maintain a stable karyotype may underiie the Suv39h embryonic phenotype 

50 [0082] Fig. 5A shows the percentages of cells in various cell cycle stages of wild-type and St/v5P/7 double null PMEFs. 
[0083] Fig. 5B shows representative images (left and middle) of aberrant mitoses in Suv39h double null PMEFs 
detected by p-tubulin (not shown) and DAPI staining. Also shown (right image) is a nucleus exemplifying the unusual 
definition of heterochromatin in a subpopulation of Suv39h double null PMEFs. All images were taken at a magnification 
of630x. 

55 [0084] Fig. 6A shows the DNA content profile of wild-type and Suv39h double null PMEFs at passage 3. Fig. 6B 
shows the DNA content profile of wild-type and Suv39h double null PMEFs at passage 8. 
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Example 7 

[0085] Increased phosH3 phosphorylation in Suv39h double null PMEFs. 

[0086] Phosphorylation at serine 10 in the N-terminal tail of H3 (phosH3) has been shown to be required for con- 
densation and subsequent segregation of chromosomes (Wei etaL, 1999). During the cell cycle, phosH3 initiates within 
pericentric heterochromatin in late G2 and then progresses along the entire chromosomes during mitosis (Hendzel et 
at., 1997). In wild-type PMEFs. approximately 7% of the cells stain posKive for the characteristic, heterochromatin- 
associated phosH3 foci, as detected by indirect immunofluorescence with anti-phosH3 antibodies (see Figure 7A. right 
panel). In contrast, this number is increased by a factor of about 3-fold in Suv39h doub\e null PMEFs. with approximately 
22% of the cells containing phosH3-posrtive foci (Figure 7A, left panel), ahhough thetr definition appears in many small 
speckles which do not always overlap with DAPI-dense material (data not shown). This result suggested that the overall 
levels of phosHS may be enhanced in Suv39h double null PMEFs. Therefore, the relative abundance of phosH3 in 
precallbrated nuclear extracts was determined with anti-phosH3-specific antflaodies. This quantitation indicated a sig- 
nificantly higher level of phosH3 in Suv39h6oub\e null cells as compared to wild-type controls (see Figure 7 B). Together, 
the obtained data are most consistent with a model in which Suv39h-mediated methylation of lysine 9 in H3 negatively 
regulates phosphorylation of serine 10. 

[0087] Fig. 7A shows an interphase chromatin staining with anti-phosH3 antibodies and CY3-conjugated secondary 
antibodies. DNA was cou nterstained wKh DAPI. At least 1 ,000 cells were counted to evaluate the percentage (indicated 
in the Figure) of anti-phosH3 positive cells. Fig 7B shows a quantitative Western analysis with 15 ^g and 30 |ig of total 
nuclear proteins immu no-blotted with anti-H3 and anti-phosH3 antibodies. 
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<110> Boehringer Ingelheim International GnOoH 

<120> Method for identifying compounds altering higher-order chromatin 
dependent chromosome staJbility 

<130> Case 14/055 

<140> 
<141> . 

<160> 9 

<170> Patent In Ver. 2,1 



<210> 1 

<211> 1452 

<212> DMA 

<213> Mus mus cuius 

<220> 

<221> 3'UTR 
<222> <1) . . (18) 

<220> 

<221> CDS 

<222> <19)..{1452) 

<400> 1 

gaatgaaagc tccgcaag atg gcg acg gcc agg gcc aag gca egg ggc agt 51 

Met Ala Thr Ala Arg Ala Lys Ala Arg Gly Ser 
15 10 

gag gca gga gcg egg tgt cac egg get cca ggt ecg ccc eeg agg ccc 99 
Glu Ala Gly Ala Arg Cys His Arg Ala Pro Gly Pro Pro Pro Arg Pro 
15 20 '25 

aag gcc agg cga acg gcg aga cgc cgc cgc gcg gag acc ctg acg gcg 147 
Lys Ala Arg Arg Thr Ala Arg Arg Arg Arg Ala Glu Thr Leu Thr Ala 
30 35 40 

cga cgc teg egg cog tot gcg ggc gag agg cgc gcc ggc tec cag cga 195 
Arg Arg Ser Arg Pro Ser Ala Gly Glu Arg Arg Ala Gly Ser Gin Arg 
45 50 55 

gcg tgg tec gga get ecg egg gee gcg gtc ttt ggc gac gag tgt gca 243 
Ala Trp Ser Gly Ala Pro Arg Ala Ala Val Phe Gly Asp Glu Cys Ala 
60 65 70 75 

cga ggt gcc tta ttc aag gee tgg tgt gtg cct tgc eta gtt tea ctt 291 
Arg Gly Ala Leu Phe Lys Ala Trp Cys Val Pro Cys Leu Val Ser Leu 
80 85 90 
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15 



25 



30 



40 



50 



55 



gat act etc cag gaa tta tgt aga aaa gaa aag etc aca tgt aaa teg 339 
Asp Thr Leu Gin Glu Leu Cys Arg Lys Glu Lys Leu Thr Cys Lys Ser 
95 100 105 

att gga ate ace aaa agg aat eta aac aat tat gag gtg gag tac ttg 387 
lie Gly lie Thr Lys Arg Asn Leu Asn Asn Tyr Glu Val Glu Tyr Leu 
110 115 120 

tgt gac tac aag gta gca aag ggt gtg gaa tat tat ctt gta aaa tgg 435 
Cys Asp Tyr Lys Val Ala Lys Gly Val Glu Tyr Tyr Leu Val Lys Trp 
125 130 135 

aaa gga tgg cca gat tot aca aac acc tgg gag ccc ttg aga aac etc 483 
Lys Gly Trp Pro Asp Ser Thr Asn Thr Trp Glu Pro Leu Arg Asn Leu 
140 145 150 155 

agg tgt cca cag etc ctg egg cag ttc tct gat gac aag aag act tac 531 
Arg Cys Pro Gin Leu Leu Arg Gin Phe Ser Asp Asp Lys Lys Thr Tyr 

160 165 no 

tta get cag gaa agg aaa tgc aag get gtc aat tea aaa tec ttg caa 579 
Leu Ala Gin Glu Arg Lys Cys Lys Ala Val Asn Ser Lys Ser Leu Gin 
175 180 185 

cct gca att get gag tat att gta cag aaa get aag caa aga ata get 627 
Pro Ala He Ala Glu Tyr lie Val Gin Lys Ala Lys Gin Arg He Ala 
190 195 200 

ctg cag aga tgg caa gat tac etc aac aga aga aag aac cat aag ggg 675 
Leu Gin Arg Trp Gin Asp Tyr Leu Asn Arg Arg Lys Asn Kis Lys Gly 
205 210 215 

atg ata ttt gtt gaa aac act gtt gac ttg gag ggc cca cct tta gac 723 
Met He Phe Val Glu Asn Thr Val Asp Leu Glu Gly Pro Pro Leu Asp 
220 225 230 235 

ttc tac tac att aac gag tac agg cca get ccc ggg ate age ata aac 771 
Phe Tyr Tyr He Asn Glu Tyr Arg Pro Ala Pro Gly He Ser He Asn 
240 245 250 

agt gaa gee ace ttt gga tgt tea tgt aca gac tgc ttc ttt gac aag 819 
Ser Glu Ala Thr Phe Gly Cys Ser Cys Thr Asp Cys Phe Phe Asp Lys 
255 260 265 

tgt tgt cct get gaa get gga gtt gtg ttg get tat aat aag aag caa 867 
Cys Cys Pro Ala Glu Ala Gly Val Val Leu Ala Tyr Asn Lys Lys Gin 
270 275 280 

caa att aaa ate caa cca ggc act ccc ate tac gaa tgc aac tea agg 915 
Gin He Lys He Gin Pro Gly Thr Pro He Tyr Glu Cys Asn Ser Arg 
285 290 295 

tgt cga tgt gga eet gaa tgt ccc aat agg att gta caa aaa ggc aca 963 
Cys Arg Cys Gly Pro Glu Cys Pro Asn Arg He Val Gin Lys Gly Thr 
300 305 310 315 
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caa tat tea ctg tgc ate ttt aaa act age aat ggc tgt ggt tgg ggt 
Gin Tyr Ser Leu Cys lie Phe Lys Thr Ser Asn Giy Cys Giy Trp Gly 
320 325 330 

gta aaa acc ctt gtg aag att aaa aga atg agt ttt gtc atg gaa tat 
Val Lys Thr Leu Val Lys lie Lys Arg Met Ser Phe Val Met Glu Tyr 
335 340 345 

gtt gga gag gtg ate aca agt gaa gag gee gag aga egg gga cag ttc 
Val Gly Glu Val lie Thr Ser Glu Glu Ala Glu Arg Arg Gly Gin Phe 
350 355 3€0 

tat gac aac aaa ggg ate ace tac etc ttt gac ctg gac tac gag tct 
Tyr Asp Asn Lys Gly He Thr Tyr Leu Phe Asp Leu Asp Tyr Glu Ser 
365 370 375 

gat gag ttc aca gtg gat gca get cga tat gga aac gta tec cat ttt 
Asp Glu Phe Thr Val Asp Ala Ala Arg Tyr Gly Asn Val Ser His Phe 
380 385 390 395 

gtg aat cat agt tgt gac cca aat ctt cag gtg ttt agt gtt ttc ate 
Val Asn His Ser Cys Asp Pro Asn Leu Gin Val Phe Ser Val Phe He 
400 405 410 

gat aac ctt gat act egg ctg ccc agg ata gca ttg ttc tct aca aga 
Asp Asn Leu Asp Thr Arg Leu Pro Arg He Ala I^u Phe Ser Thr Arg 
415 420 425 

acc ata aac get gga gaa gag ctg act ttt gac tat caa atg aaa ggt 
Thr He Asn Ala Gly Glu Glu Leu Thr Phe Asp Tyr Gin Met Lys Gly 
430 435 440 

tct gga gaa gca tct tea gac tec att gac cac age cct gcc aaa aaa 
Ser Gly Glu Ala Ser Ser Asp Ser He Asp His Ser Pro Ala Lys Lys 
445 450 455 

agg gtc aga acc caa tgt aaa tgt gga gcC gag act tgc aga ggt tac 
Arg Val Arg Thr Gin Cys Lys Cys Gly Ala Glu Thr Cys Arg Gly Tyr 
460 465 470 475 

etc aac tga 
Leu Asn 



<210> 2 
<2H> 477 
<212> PRT 

<213> Mus mus cuius 
<400> 2 

Met Ala Thr Ala Arg Ala Lys Ala Arg Gly Ser Glu Ala Gly Ala Arg 
1 5 10 15 

Cys His Arg Ala Pro Gly Pro Pro Pro Arg Pro Lys Ala Arg Arg Thr 
20 25 30 



16 




EP 1 162 274 A1 



10 



Ma Arg Arg Arg Arg Ala Glu Thr Leu Thr Ala Arg Arg Ser Arg Pro 
35 40 45 

Ser Ala Gly Glu Arg Arg Ala Gly Ser Gin Arg Ala Trp Ser Cly Ala 
50 55 60 

Pzo Arg Ala Ala Val Phe Gly Asp Glu Cys Ala Arg Gly Ala Leu Phe 
65 10 75 80 

Lys Ala Trp Cys Val Pro Cys Leu Val Ser Leu Asp Thr Leu Gin Glu 
85 90 95 



15 



Leu Cys Arg Lys Glu Lys Leu Thr Cys Lys Ser lie Gly lie Thr Lys 
100 105 110 



Arg Asn Leu Asn Asn Tyr Glu Val Glu Tyr Leu Cys Asp Tyr Lys Val 
115 120 125 



20 



Ala Lys Gly Val Glu Tyr Tyr Leu Val Lys Trp Lys Gly Trp Pro Asp 

130 135 140 



25 



Ser Thr Asn Thr Trp Glu Pro Leu Arg Asn Leu Arg Cys Pro Gin Leu 
145 150 155 160 

Leu Arg Gin Phe Ser Asp Asp Lys Lys Thr Tyr Leu Ala Gin Glu Arg 
165 170 175 



Lys Cys Lys Ala Val Asn Ser Lys Ser Leu Gin Pro Ala lie Ala Glu 
180 185 190 

Tyr He Val Gin Lys Ala Lys Gin Arg lie Ala Leu Gin Arg Trp Gin 
195 200 205 

Asp Tyr Leu Asn Arg Arg Lys Asn His Lys Gly Met He Phe Val Glu 
210 215 220 

Asn Thr Val Asp Leu Glu Gly Pro Pro Leu Asp Phe Tyr Tyr He Asn 

225 230 235 240 

Glu Tyr Arg Pro Ala Pro Gly He Ser He Asn Ser Glu Ala Thr Phe 
245 250 255 



45 



Gly Cys Ser Cys Thr Asp Cys Phe Phe Asp Lys Cys Cys Pro Ala Glu 
260 265 270 

Ala Gly Val Val Leu Ala Tyr Asn Lys Lys Gin Gin He Lys He Gin 
275 280 285 



Pro Gly Thr Pro He Tyr Glu Cys Asn Ser Arg Cys Arg Cys Gly Pro 
290 295 300 

Glu Cys Pro Asn Arg He Val Gin Lys Gly Thr Gin Tyr Ser Leu Cys 
305 310 315 320 
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He Phe Lys Thr Ser Asn Gly Cys Gly Trp Gly Val Lys Thr Leu Val 
325 330 335 
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Lys lie Lys Arg Met Ser Phe VaX Met Glu Tyr Val Gly Glu Val lie 
340 345 350 

Thr Ser Glu Glu Ala Glu Arg Arg Gly Gin Phe Tyr Asp Asn Lys Gly 
355 360 365 

He Thr Tyr Leu Phe Asp Leu Asp Tyr Glu Ser Asp Glu Phe Thr Val 
370 375 380 

Asp Ala Ala Arg Tyr Gly Asn Val Ser His Phe Val Asn His Ser Cys 
385 390 395 400 

Asp Pro Asn Leu Gin Val Phe Ser Val Phe lie Asp Asn Leu Asp Thr 
405 410 415 

Arg Leu Pro Arg He Ala Leu Phe Ser Thr Arg Thr lie Asn Ala Gly 
420 425 430 

Glu Glu Leu Thr Phe Asp Tyr Gin Met Lys Gly Ser Gly Glu Ala Ser 
435 440 445 

Ser Asp Ser He Asp His Ser Pro Ala Lys Lys Arg Val Arg Thr Gin 
450 455 460 



Cys Lys Cys Gly Ala Glu Thr Cys Arg Gly Tyr Leu Asn 
465 470 475 



<210> 3 

<211> 543 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<222> (1) . . (543) 

<223> EST Acc. No. 173625 

<400> 3 



ggccatgtgg 
tatagtccag 
attacctttc 
acaccccagc 
ttgtacaatc 
gggagtaaca 
ttcaccaaga 
atttactaag 
tgggccttct 
gcg 



ttgancccct 
aatcattgtt 
caacttattc 
catggtcatt 
ctattggtac 
ggtgggattt 
caacattttt 
ctgattccag 
aagtgaaccc 



ggntttaccn 
gtcatataac 
catgacaaaa 
gatactgtga 
agtgaggtcc 
taatttgttg 
catagaagca 
gagctggttt 
tattntcaac 



nnccntggnn 
tgccctcatc 
cttattcttt 
aagatgcaaa 
acattgacag 
gtttttacta 
atctctgcat 
gtattcatca 
aaatatcact 



ggnnttgann 
tttcagcttc 
taatcttcac 
gtgaattact 
attgagatgc 
taagccaaaa 
gaacaacaaa 
atatagcaga 
cctttattat 



ccccttagat 
gtcacttg^g 
atgggttttt 
gtgtgccttt 
atttatagat 
gaattccagc 
aggtagcttc 
agtctgaagg 
tctgtcttct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

543 



<210> 4 
<211> 579 
<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_feature 

<222> (l)..(5-79) 

<223> EST Acc. No. AQ4 94 637 

<400> 4 

gcttctcata catgatacgt gttcngctct 
aaggnagggg ngncntttga ttcgtgtgat 
tcattntgtg aatcatagta gtgacccaaa 
cttgacactg gccttcccta tatagctctg 
gagttgattt ttgacaatca aacaaaaagt 
cacagctcag ccaaaaagag ggtcagaact 
ggttgcctca aatgaatttt caggaaatag 
ttctaatgtt atcattctaa aaataagtat 
attactatgt taaattgaca tncatggttc 
tcttgagaga gggccactgt gttgcataga 



gnngtntnng tttangaata cntaaaanaa 60 
tccatagatg cactcatatg gaactgtatt 120 
tcttcatatg ttctatgntn tcactgataa 180 
tgttccatga gaactataaa tgctggagaa 240 
tctggggata tatcttcaga gtttattgac 300 
gtatgtaaat gtggagctgt gacttgcaga 360 
aaatgatgat aattggtagt tgtttctttt 420 
ttggaactct cttttcatat tatcaagatt 480 
aaggcattta ccanatgcat tactgatgcc 540 
ctgatctga 579 



<210> 5 
<211> 565 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc^feature 

<222> (1)..<565) 

<223> EST Acc. No. AQ691972 

<400> 5 

agaggatgag catggatcnt cgctatagca 
catctataaa tgcatctcaa tctgtcaatg 
aaaaggcaca cagtaattca ctttgcatct 
taaaaaccca tgtgaagatt aaaagaataa 
tcacaagtga cgaagctgaa agatgagggc 
gaatctgatg aattcacaga ggatgcagct 
cataagtagt gacccaaact tcatatgttc 
tttccttaat agctctgtgt tccatgagaa 
acatcaacaa aagttctggg attatcttca 
aaactgttgt aatgtgagct gtact 



aaccacanat anaatcccac ctgttactcc 60 
tggaccttac tgtaccaata ggattgtaca 120 
ttcacagtat caatgaccat ggctggggtg 180 
gtttt-gtcat ggaataagtt ggaaaggtaa 240 
agttatatga caacaaatga tctggactat 300 
caatatggaa ctgtatttca ttntgtgaat 360 
aatcttntca ttgataactt gacactggcc 420 
ctataaatgc tggagaagaa gtgatttttg 480 
aagttattgc cacagttacc aaaagaaggc 54 0 

565 



<210> 6 
<211> 535 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<222> (1) . . (535) 

<223> EST Acc, No. AQ554070 
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<400> 6 

tcagactcat agtccagatc aaagagattc tgtgattccc ttgttgtcat agaactgtcc 60 
tcgtctttca gcttcttcac ttgtgattac ctaaacagaa aaaactgtaa gtatattacg 120 
tagctactga accaaagaag cattcatcta cctatctact aatatgcgaa tacctacaaa 180 
tatttaaaaa gtaagaaatt caggtgtcat caaagcaaac attcacacaa actaagactc 240 
agatgcaaag aggtgggaaa atgaggggaa gaaaaatgat aatgcaaaag actgatgacc 300 
tttttttttt aaacagggtc tcactctgtc actcaggcta gaatgcggtg gtgccatcat 360 
gactccctgt atcctttaac tcctgggatc aagcgatctt cctgcctcag cctcctgact 420 
agctggatca caggtgcata ccgccatgcc cagctaatga tttagttttt atagagatgt 480 
ggggtctcac tatgttgccc acactggtct ggaactcctg ggctcaagtg agcct 535 



<210> 7 
<211> 20 
<212> PRT 

<2I3> Homo sapiens 
<400> 7 

Ala Arg Thr Lys Gin Thr Ala Arg Lys Ser Thr Gly Gly Lys Ala Pro 
15 10 15 

Arg Lys Gin Leu 
20 



210> 8 

<211> 20 

<212> PRT 

<213> Homo sapiens 

<400> B 

Met Gly Pro Arg Arg Arg Ser Arg Lys Pro Glu Ala Pro Arg Arg Arg Ser Pro Ser Pro 
15 10 15 20 



21G> 9 
<211> 22 
<212> PRT 

<213> Rattus rattus 
<400> 9 

b5et Glu Thr Ser Ser Arg Gly Gly Lys Lys Lys Ser Thr Lys Thr Ser Arg Ser Ala Lys 
15 10 15 20 

Ala Gly 
22 



20 
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Claims 

1. A method for identifying a compound that alters higher order chromatin dependent chromosome stability, said 
method comprising incubating a substrate for a methy {transferase. In the presence of a methyl donor, with a meth- 
yitransf erase with Suv39h-like methyttransf erase activity, in the presence or absence of a lest compound and 
determining whether the compound modulates the methyltransferase activity. 

2. The method of claim 1, vyherein the methyltrEinsf erase with Suv39h-like activity is murine Suv39h1 or human 
SUV39H1. 

3. The method of claim 1, wherein the methyltransferase with Suv39h-like activity is murine Suv39h2 or human 
SUV39H2. 

4. The method of any one of claims 1 to 3, wherein the substrate is histone H3 or N-terminal fragment thereof that 
IS contains the methylation site at lysine 9. 

5. The method of claim 4. wherein the histone H3 N*terminai fragment has the amino acid sequence as set forth in 
SEQ ID NO:7. 

20 6. The method of any one of claims 1 to 5, wherein the methyl donor is radio-labelled methionine or S-adenosyl-L- 
methionine. 

7. The method of any one of claims 1 to 6, wherein the methyl donor carries a chromogenic label and the methyl- 
transferase activity is determined by measuring the change in colour upon transfer of the methyl group to the 

25 subtrate. 

8. The method of any one of claims 1 to claim 6. wherein the methyl donor carries a radioactive label and the meth- 
yltransferase activity is determined by measuring the radioactivity transferred to the substrate upon transfer of the 
methyl group. 

30 

9. The method of any one of claims 1 to 6, wherein the methyltransferase activity is determined immunologically by 
quantifying the binding of an antibody specific for the methylation site to the substrate. 



35 



10. An antibody that specifically recognises methylated lysine 9 in histone H3. 

1 1. A compound identified in a method defined in any one of claims 1 to 9 for the therapy of cancer 
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Fig. 2/2 
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Fig. 3/2 
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Fig. 4/1 
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Fig. 4/2 
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Fig. 4/3 
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